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Foreword 


This  collection  of  notes  is  a b;-product  of  psychological 
investigations.  The  piorpose  of  these  studies  was  to  assess 
human  perfomance  in  information  processing.  In  particular, 
it  has  been  atten^ted  to  measxire  hximan  capabilities  of  ac- 
quiring  and  transmitting  information.  Measurements  were  ob- 
tained in  terms  of  the  Shannon-Wiener  Measure  of  Information, 
and  the  related  Measure  of  Transmission  Rate.  This  implies 
categorization  of  all  stimuli  and  responses,  and  estimation 
of  the  probabilities  of  occurrences  for  all  possible  associ- 
ationib  of  stimvili  and  responses.  In  many  engineering  applica- 
tions, the  nvunber  of  categories  is  low;  in  psychological  ex- 
periments, it  tends  to  be  high.  For  instance,  in  a letter- 
recognition  experiment,  there  are  26  possible  Inputs  and  out- 
puts, and  676  stimulus -response  pairs.  Furthermore,  the  proba- 
bility of  a given  answer  to  a given  stimulus  depends  also  on 
preceding  and  simultaneous  (neighboring)  other  stimuli  and 
responses^  thus,  the  number  of  distinguishable  categories  be- 
comes very  large.  In  order  to  associate  a probability  measure 
with  every  single  category,  a large  sample  is  needed;  the 
greater  the  precision  required,  the  lEU>ger  the  sample.  This 
can  lead  to  an  Inordinate  amount  of  labor  in  data  taking  and 
con5)uting.  Moreover,  it  seems  that  one  cannot  reach  arbitrarily 
high  precision  by  extending  the  observation  to  great  length; 
it  is  likely  that  during  a long  series  of  trials,  and  partly 
as  a consequence  of  such  trials,  the  underlying  probabilities 
remain  not  constant.  Thus,  it  is  more  than  a convenience  to 


ii 


replace  the  exact  computation  with  approximating  :jhortcuta# 
based  on  samples  of  moderate  size. 

In  this  laboratory.  In  dealing  with  specific  aspects  of 
these  problems  which  arose  from  experimental  studies j we  have 
tried  to  obtain  solutions  of  slightly  greater  generality  than 
needed  in  the  particular  instance*  The  result  are  a number 
of  teciuilques  which  have  worked  in  some  cases,  and  may  be  ex- 
pected to  be  useful  in  others;  they  are  presented  in  this 
report* 


Henry  Quastler 


ill 


NOTES  FOR 

H»  Quastler 
H.  Quastler 

A.  A.  Blank 
H.  Quastler 
A.  A.  Blank 
A.  A*  Blank 

H.  Quastler 


THE  ESTIMATION  OP  INFORMATION  MEASURES 
Table  of  Contents 


"Information  Measures  in  Incompletely 
Known  Situations" 

"Equivocation  as  the  Sum  of  Error- 
Locating  and  Error-Correcting 
Information" 

"Upper  Bounds  for  the  Equivocation" 
"Remarks  about  the  Method  of  Hints" 

"A  Method  of  Hints" 

"The  Uncertainty  Measure  for  Quantized 
Normal  Distributions" 

"Different  Methods  Compared" 


Incompletely  Known  Situations  1 


INFORMATION  MEASURES  IN  INCOMPLETELY  KNOWN  SITUATIONS 

Henry  Quastler 

The  oatimation  of  Information  (or  uncertainty,  specifi- 
city, entropy),  H,  involves  the  following  three  operations: 

(1)  the  classification  of  all  relevant  occurrences.  (2)  the 
estimation  of  the  probability  associated  with  each  class,  and 
(3)  the  computation  of  an  information  function  based  on  these 
estimates*  In  the  ideal  case,  if  it  is  known  that  there  are 
exactly  r classes,  and  that  the  probabilities  are  p(l)  (where 
i=l,  2,,,rj^  p(i)=l);  then,  the  uncertainty,  H,  is 
defined  by  the  well-known  equation 

H * - ^ p(i)  log  p(l). 

In  most  actual  situations,  it  is  Inipossible  to  know  all  the 
alternative's  or  to  assess  accurately  the  probabilities  associ- 
ated with  every  single  one.  This  could  seriously  in5>air  the 
practical  value  of  Information  measures;  it  is  the  purpose  of 
this  note  to  show  that  it  does  not.  It  will  be  demonstrated 
that  a rough  estimate  of  H 1s  possible  as  soon  as  the  major 

alternatives  and  their  approximate  probabilities  are  known* 
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The  measure  of  specificity  is  a sum  of  terms  "p  log  i”* 

P 

This  function  rises  steeply  up  to  p = .10,  which  accounts  for 
the  information  measure  being  not  very  sensitive  to  rare  alter- 
natives; and,  it  has  a flat  top  for  values  of,£  between  0.20 
and  0,60,  which  accounts  for  the  small  effect  of  moderate 
fluctuations  in  probability,  (Pig.  1) 

a - Small  Effect  of  Rare  Occurrences 

The  following  examples  will  Illustrate  that  the  measure 
of  specificity  is  not  sensitive  to  rare  alternatives.  To  be- 
gin with  a hypothetical  case:  Suppose  nine  alternatives  are 

known  to  account  for  90^  of  all  occurrences;  if  they  are  equi- 
probable,  then  their  contribution  to  the  measure  of  specificity 
Is  9 X ^ logg  10  = 2,99.  We  now  fill  ths  remaining  10^  with 
a varying  number  of  equlprobable  alternatives;  the  results  are 
tabulated  below: 


TABLE  I: 

Effect  on  Uncertainty  of  Minor  Alternatives  (adding  up  to  1.0% 
of  all  occtirrences ). 


No.  of 
Minor 
Alter- 
natives 
respons- 
ible for 
residual 
1051^ 

Proba- 

bility 

of 

each 

Aggre- 
gate 
Contri- 
bution 
to  H 

Total  H if 
90^  of” oc- 
CTirrences 
made  up  by 
9 equipro- 
bable  alter- 
natives 

No,  of  equi- 
probable  al- 
ternatives 
giving  same 
total  H 

(Pig.  2a)  1 

1/10 

.33 

3.32 

10 

( * 2b)10 

1/100 

,66 

3.65 

13 

*1  c^rs 

1/lOGO 

1.00 

3.99 

16 

10,000 

1/100,000 

1.66 

4.65 

25 

1,000,000 
— ■— 1 

1/10,000,000 

2.33 

5.31 

41 

1 

In  this  situation.  If  one  tinderestlmates  the  number  of  minor 


alternatives  by  a factor  of  100,  the  resulting  error  In  H Is 
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only  about  one-sixth.  If  only  of  all  occurrences  are  to 
be  filled  by  unknown  minor  alternatives,  their  contribution 
is  even  less  conspicuous: 


TABLE  II: 

Contributions  to  Total  Uncertainty  by  Minor  Alternatives 
(adding  up  to  of  all  occwrences) 


No,  of  minor 
alternatives 

Probability  of  each 

Contribution  to  H 

1 

.22 

10 

5/1000 

.38 

1000 

Vio,ooo 

.55 

10,000 

VI » 000,  000 

.88 

1,000,000 

5/1,000,000,000 

1.21 

The  difference  between  the  extreme  values  in  this  table  can 
be  made  Intuitively  clear  by  expressing  vincertainties  in  terms 
of  equivalent  n\un.ber  of  equiprobable  alternatives  (as  shown 
in  the  table  I),  Suppose  that  this  nximber  is  known  for  a 
single  alternative  accounting  for  of  the  occtarrences,  the 
other  95^  being  distributed  in  any  arbitrary  fashion;  then, 
if  the  single  alternative  is  replaced  by  one  million  equi- 
probable ones,  without  otherwise  changing  the  distribution, 
the  equivalent  number  of  equiprobable  alternatives  is  just 
doubled. 

An  Investigation  by  A.  A,  Blank  furnished  an  Impressive 
real  example.  He  calculated  the  specificity  of  single  English 
words;  for  particular  reasons,  the  sample  was  restricted  to 
{^i-letter  words.  The  uncertainty  v;as  obtained  as 

= -Ti  (N=Z\) 
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where is  the  observed  frequency  of  the  i’th  l|.-letter  word 
in  the  Thorndyke  list.  He  also  determined  the  values  of  H 
obtained  by  successive  elimination  of  the  less  frequent 
words.  The  results  are  shown  in  table  III; 

TABLE  III; 

Measure  of  Uncertainty  for  l|.-letter  words  (data  of  A,  A,  Blank) 


no. 

of  words 

% of  all  words 

H 

All  4-“letter  words 
in  Thorndyke* a list 

1^50 

100.0 

1 8.13 

100 

Only  words  with 
frequency  ^ l50 

865 

55.8 

7.98 

98.1 

Only  words  with 
frequency  75o 

395 

25.5 

7.4-7 

91.8 

Only  words  with 
frequency  ^ l5^0 

214. 

13.8 

6,89 

84-.  8 

Only  words  with 
frequency  31^0 

119 

7.7 

6.34- 

77.8 

Thus,  taking  into  consideration  only  l/lO  of  all  categories 
(which  probably  accovint  for  more  than  l/2  of  all  occtirrences) 
yields  already  about  4/5  of  the  final  measure  of  specificity. 


The  examples  given  show  that  the  information  function  is 
not  sensitive  to  rare  occurrences  - which  means  that  it  should 
not  be  used  whenever  infroqxxent  occurrences  must  be  heavily 
weighed;  on  the  other  hand,  it  can  be  used  successfully  in 
situations  which  are  not  completely  known, 

b - Small  Effect  of  Small  Variations  in  Probability 
Any  operation  which  tends  to  average  probabilities  in- 
creases the  uncertainty;  therefore,  if  r alternatives  have 
an  aggregate  probability  P then  their  contribution  to  the 
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measure  of  uncertainty  Is  greatest  if  they  have  all  equal  pro- 
babilities (P/r)»  We  will  now  demonstrate  that  moderate  de- 
viations from  equiprobabllity  do  not  markedly  affect  the  un- 
certainty* 

Consider  the  simplest  case  of  P = 1 and  r = 2*  If  the 
two  probabilities  are  equal,  then  H = Ij  if  their  ratio  is 
1:2,  H = 0*92;  if  the  ratio  is  1:3*  a very  considerable  de- 
viation from  equality,  H is  still  0*8l* 

For  larger  values  of  r,  the  insensitivity  of  H against 
probability  distortion  is  still  more  pronounced.  For  instance, 
we  may  replace  the  9 equiprobable  major  alternatives  in  our 
first  example  (table  I)  by  sets  of  9 aliornatives  with  pro- 
babilities staggered  arithmetically  (fig.  3a)  or  geometrically 
(fig*  3h)*  stipulating  only  that  the  span  between  the  extreme 
values  should  be  within  one  order  of  magnitude.  The  resulting 
changes  in  H are  quite  small. 

We  come  to  the  following  conclusions  if  a situation  is 
analyzed  to  the  degree  that  we  feel  we  can  classify  90-95^ 
of  all  occurrences;  and  if  the  probabilities  associated  with 
each  class  are  approximately  known;  then  we  are  entitles  to 
make  a rough  estimate  of  the  information  measure. 
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EQUIVOCATION  AS  THE  SUM  OP  ERROR-LOCATING 
AND  ERROR-CORRECTING  INFORMATION 

H.  Quastler 

The  estimation  of  information  transmission  is  based  on 
the  estimation  of  probabilities  for  all  possible  input-output 
pairs.  There  are  many  cases,  in  particular  those  where  human 
performances  are  to  be  assessed,  where  there  is  a host  of  In- 
put-output categories,  more  than  one  could  ever  hope  to  fill 
adequately  by  means  of  the  usual  experimental  sampling  proce- 
dures, and  more  than  would  tax  the  resotirces  of  a small  battery 
of  computers.  For  that  reason.  It  is  useful  to  have  bovmds 
for  information  transmission  and  equivocation.  In  case  of 
doubt,  the  bound  should  be  on  the  conservative  side  (which 
means  a lower  botind  for  transmission,  an  tqpper  botuid  for  equi- 
vocation). Ideally,  an  efficient  bound  is  sought.  This  means 
that  if  one  does  make  suitable  allowance  for  all  possible  con- 
tingencies in  the  input-output  table,  the  bo\uid  should  be 
equal  ' the  value  sought.  Such  a bound  for  the  equivocation 
can  bo  constructed  as  follows:  we  imagine  an  a\ixlllary  source 

(an  "ideal  observer"  ) which,  knowing  both  input  and  output, 
furnishes  such  information  as  is  needed  to  reconstruct  the  in- 
put from  the  output.  We  will  show  that  if  most  efficient  cod- 
ing is  used,  the  amoxint  of  information  produced  by  the  » 

iary  source  becomes  the  equivocation. 

Let  H(in),  H(out),  and  H(aux)  denote  the  uncertainties 
(per  xmit  act)  of  input,  output,  and  auxiliary  soxarce,  re- 
spectively. Subscripted  symbols  denote  conditional  vincer- 
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tainties.  Then  (Shannon,  p.  37 )» 

®out,aux^ ^ ” H(aux) 

If  it  is  possible  to  reconstruct  the  input  con5>letely  from 
the  output  and  the  axixiliary  message,  then  van- 

ishes and 

H(aux)  > H^^^dn) 


which  gives  the  \:q>per  bound  desired.  It  has  been  shown  (Shan- 
non's theorem  10,  p.  37)  that  It  Is  possible  to  approximate 
H^ut^i*^)  H(aux)  as  closely  as  it  is  desired  by  means  of  ef- 
ficient coding.  More  precisely,  it  is  possible  to  correct  all 
but  an  arbitrarily  small  amount  e of  the  errors  in  the  output 
with  a channel  of  capacity  HQ^^(in).  For  such  an  efficient 
code  we  have  necessarily 


=out<  In)  ^ H(a\ix  ) > - ‘ 


It  follows  that  the  uncertainty  of  a fully-correcting  auxil- 
iary source  is  an  efficient  bo\ind  as  desired. 

Shannon's  theorem  states  what  could  be  ideally  obtained 
with  perfect  coding;  it  does  not  say  how  such  a code  is  to  be 
constructed.  In  the  sitxiation  we  are  considering,  the  auxil- 
iary source  wo\ild  have  to  be  designed  with  perfect  knowledge 
of  the  properties  of  the  input  and  of  the  channel.  But,  it 
is  precisely  such  knowledge  which  we  try  to  establish  by  ex- 
perimental tests;  hence,  we  cannot  establish  an  optimum  be- 
havolr  for  the  ideal  observer.  What  we  can  actually  do  is  the 
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following:  we  begin  by  designing  an  auxiliary  source  whlclx 

will  make  the  equivocation  vanish.  The  Information  furnished 
by  this  source  Is  necessarily  an  upper  bound  of  the  amotuit  of 
equivocation.  Subsequently,  we  use  any  knowledge  we  may  have 
about  regularities  In  the  occurrence  of  errors  to  reduce  the 
amount  of  Information  required  from  the  auxiliary  source. 

This  can  be  done  In  successive  steps.  In  addition,  we  can 
try  to  name  lower  bounds  for  the  amount  of  information  required 
from  the  auxiliary  source;  this.  If  successful,  will  bracket 
the  true  value  of  the  equivocation. 

It  Is  sometimes  convenient  to  partition  the  auxiliary 
soxirce  Into  two  sources,  one  which  furnishes  data  needed  to 
locate  errors,  and  one  which  servos  to  correct  errors  after 
they  are  located.  Accordingly,  the  amount  of  equivocation  Is 
partitioned  Into  error-locating  Information  of  amount  H(loc), 
and  error-correcting  information  of  amount  H(cor): 

Ho^^(in)  e H(loc)  ♦ H(cor) 

The  amount  of  information  needed  for  error  location  de- 
pends on  the  pattern  of  making  errors.  If  this  is  a lawful 
and  known  pattern,  then  no  error-locating  Information  is  needed. 
Suppose,  for  Instance,  that  S makes  a mistake  exactly  every 
l|.th  time;  then  to  know  the  location  of  all  errors  one  has  to 
know  only  the  location  of  any  one  false  transmission  (a  negli- 
gible amottnt  of  Information  con^ared  to  the  total  information 
In  a long  message).  Or,  If  It  Is  known  that  S Is  always 
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wrong  when  his  outputs  are,  say,  of  the  kind  '*M,”  ”P,"  or 
then  no  extra  information  will  be  needed  to  disregard  these 
parts  of  his  report.  If,  on  the  other  hand,  the  commission 
of  an  error  is  a random  event,  then  some  information  is  needed 
to  locate  it*  Suppose,  now,  that  the  total  ensemble  of  errors 
committed  includes  errors  which  are  quite  lawful  and  predic- 
table (provided  the  laws  are  known)  and  others  which  are  ran- 
dom events*  As  only  the  latter  need  any  error-locating  infor- 
mation, it  follows  that  the  larger  the  fraction  of  random  er- 
rors, the  greater  the  amount  needed  for  error  location. 

If,  on  the  basis  of  prior  knowledge  of  the  S<s  behavior, 
one  can  extract  from  the  output  itself  any  indications  con- 
cerning error  location,  then  these  indications  can  and  should 
be  used  in  constructing  the  error  locating  code.  If  it  is 
known  that  all  outputs  are  equally  liable  to  bo  erroneous,  then 
the  amount  of  the  error  locating  information  is  maximum. 
Therefore,  if  one  does  not  know  how  errors  are  distributed  in 
the  output,  he  may  assume  them  to  be  equlprobable  suid  thus  ob- 
tain an  upper  bound  of  the  amount  of  information  needed  to 
siipplement  the  S*s  report  in  order  to  locate  the  errors  it 
contains.  Deviations  from  equal  distribution  of  errors  have 
to  be  quite  pronoxinced  before  they  reduce  the  value  of  the 
supplementary  information  much  below  the  maximum. 

The  following  example  is  taken  from  experiments  performed 
in  this  laboratory  (Quastler  and  Wulff).  Ss  had  the  task  of 
copying  sequences  of  random  letters  on  the  typewriter*  In  a 
particular  sanple  of  10,260  letters,  we  found  3*4)^  errors* 
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In  first  approxlxnatlon*  we  assume  that  errors  occur  cosqpletely 
randomly;  with  this  model,  we  get: 

H(loc)  * - 0*034  logo  0*034  * 0,966  logo  0.966  « 0*214  bits/ 

letters 

However,  It  was  quite  obvious  that  clusters  of  errors 
occurred  more  frequently  than  could  bo  expected  by  chance 
grouping*  Thus,  in  one  particular  sanq[>le  of  10,260  letters 
we  found: 


single  errors  * 

*.*  158 

times. 

or 

1.54^ 

pairs  • . . « • 

. . 42 

II 

n 

0,415^ 

triplets  • . . . 

> * * 22 

n 

n 

0,21% 

quadruplets  * • 

. * 9 

n 

II 

0,09% 

quintuplets  * * 

. * 1 

N 

n 

0,01% 

Thus,  while  the  over-all  probability  of  any  key  being 
wrong  was  the  probability  of  a wrong  key  following  a 

wrong  key  was  33*5^*  or  almost  ten  times  as  large.  Accord- 
ingly, we  Introduce  a second  approximation,  and  treat  the 
error -generating  mechanism  as  a Harkov  process,  with  the  pro- 
bability of  an  error  occurrence  depending  on  (and  only  on) 
the  success  or  failure  of  the  preceding  act*  The  Markov  pro- 
cess Is  characterized  by  the  piTobabilltles; 

Prob  ■[  success  follovring  success^  = 0,977 
Prob  ^ error  following  success  } * 0*023 

Prob  ^success  following  error  ^ = 0.66^ 

Prob  'l^error  following  error  ^ « 0*335 
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Prom  these  probabilities  follows: 

Hj^(loc)  *=  - 0.03U>c(0.335  log2  0.335  + 0.665  logg  0.665)  - 

0.966  X (0.023  log£  0.023  + 0.977  log2  0.977)  * 0.18^  bits/ 

key 

ThuSf  this  approximation  reduces  the  error-location  uncertain- 
ty by  about  one-sixth.  Additional  refinements  are  suggested 
by  the  data,  but  will  not  result  in  any  significant  reduction 
of  uncertainty;  e.g.,  introduction  of  error  probabilities  con- 
ditional upon  the  two  preceding  acts,  gives  a value  H2(loc) 
which  Is  0.185  blts/key,  thus,  not  smaller  than  the  value  for 
Hj^(loc). 

The  amount  of  information  needed  to  correct  errors  depends 
equally  on  the  error  pattern.  A given  output,  even  if  known 
to  be  erroneovLs,  might  still  s\iggest  a limited  range  of  pos- 
sible inputs.  For  Instance,  in  playing  piano,  an  error  is 
likely  to  be  not  very  far  from  the  target  key.  We  have  used 
this  consideration  to  bracket  HQ^|.(ln)  between  values  obtained 
by  assvimlng  error  ranges  which  are  certainly  too  large  and  too 
small,  respectively.  If  no  regularities  in  the  error  pattern 
are  known,  then  all  one  knows  about  the  input  is  that  it  is 
not  the  output;  in  the  special  case,  frequently  occurring  in 
the  laboratory,  where  one  has  k equlprobable  input  categories, 
end  each  input  is  equally  likely  to  produce  a faulty  output, 
we  have 

H(cor)  * g log£  (k-1) 


In  the  binary  case,  H(eor)  vanishes* 
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Tha  method  hare  discussed  is  useful  in  two  ways.  First# 
it  establishes  the  relations  between  error  rate,  error  pattern, 
and  information  transmission  in  a fashion  which  is  lucid  and 
easy  to  survey;  thus,  we  found  it  convenient  to  use  for  a first 
coiT5)Utation  of  T in  situations  not  previously  dealt  with.  Se- 
cond, the  method  enables  one  to  perform  an  estimation  of  T in 
successive  steps;  it  establishes  upper  bounds  for  the  equivo- 
cation which  together  with  the  lower  boxmd  (zero)  define  an 
interval  within  which  T must  lie;  any  knowledge  about  error 
pattern  can  then  be  used  to  narrow  down  the  Interval. 


Reference: 

Shannon,  C.  and  W.  Weaver  The  Mathematical  Theory  of 
Communication.  University  of  Illinois  Press,  tTrb'ana, 
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UPPER  BOUNDS  FOR  THE  EQUIVOCATION 
A.  A*  Blank 

Shannon  defines  Information  transmitted,  T(ln,out),  as 
T(in,out)  * H(in)  - B^^^dn) 


where  H(ln)  Is  the  uncertainty  of  the  sotirce  and  H^^^Cin)  is 
the  equivocation,  or  the  imcertalnty  as  to  the  sotirce  at  the 
receiving  end  of  a channel.  Set  p^j  = joint 

probability  that  the  1-th  input  category  is  coupled  with  the 


j-th  output  category.  We  set 

“ f Pij* 


13  “ 2 pij 


pU|i)  - ^ 


i(tU)  “ . 


H(ln)  « - £ Pi  log  Pi 

Hg^^(ln)  = - £ qj  ^ q(i|j)  log  q(l|J). 

The  quantity  H^(ln)  = £ q(l|j}  log  q(l|j)  is  called 
•'  1 

the  equivocation  whan  the  j-th  output  occurs  or  the  uncertainty 
as  to  the  source  with  respect  to  the  j-th  output.  If  the  pur- 
pose of  the  channel  is  to  duplicate  the  input,  that  is,  to 
couple  an  input  in  the  1-th  category  with  an  output  in  the 
same  category,  it  is  desirable  to  locate  and  correct  errors. 

The  quantity  Hj(in}  may  be  represented  as  a simi 

Hj(ln)  *=  Hj(loc)  + Hj(cor) 


■ 
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Her* 


Hj(loo)  e - q(jlj)  log  q(j|j) 

- (i-q(JlJ)]* 


Hj(loc)  is  called  the  error*  7.ocating  Information  with  respect 
to  the  J-th  output  and  represents  the  conditional  uncertainty 
of  a so\irce  which  reports  on  the  truth  or  falsehood  of  an  out- 
put in  the  j-th  category.  The  quantity,  Hj(cor)  is  called  the 
correction  information  with  respect  to  J and  satisfies 

Hj(cor)  . - I log 

(1^1) 


= - £ qUlJ)  log  q(i|j)  ♦ [i-q(j|jQ  log  [i-qUlJ^ 


Hj(cor)  is  the  information  required  to  correct  an  output  on 
the  condition  that  It  falls  In  the  j-th  category. 

As  the  error- locating  Information  associated  with  the 
channel  we  take 

H(loc)  = - p log  p - (1-p)  log  (1-p) 

where  p = ^ PH#  As  the  correction  Information  associated 
with  the  channel  we  take 

H(oor)  . - (1-p)  £ log 
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JL  ^ H(loc)  + H(cor) 


Proof!  *=  - £ ^ q(l|j)  log  q(i|j) 

* " ? ^ qj  q(i|J)  log  q (i|j) 

* - C qj^  q(i|i)  log  q(ifi) 

qdjJ)  log  q(i)j)  + qi*o  log  ^ 
^ Pii^  log  £'  (viO 

^ - p log  p - £ (Pi-Pll)  log  (Pl-Pil). 

since  the  last  term  on  the  right  is  equal  to  H(loc)  ♦ H(cor) 
the  proof  is  con5)lete.  Inequality  1 may  be  replaced  by 
stronger  inequalities  with  respect  to  H(loc)  and  H(cor).  In 
particular 9 it  will  be  shown  that  £qj  Hj  (loc)^  H(loc) 
and  ^ qj  Hj  (cor)  ^ H(cor)* 
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Inequality  2:  Hj(lo;c)  <H(loc) 

Proof; 

Hj(loc)  e - £qj  q(Jjj)  log  q(j|j) 

- ^ log  [I-q(3|jg 

^ J )J)  log  q(J(  J) 

-^qj(T-q(j|j3  log  -^-<ij[i“q(J/J^ 

^*f  Pjj  108 

•f  f ‘V“’33> 

4 - p log  p - (1-p)  log  (1-p)  *=  H(loc), 
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lasquallty  3t  H^(cor)  4 H(cor). 

Th»  proof  of  this  Insquall ty  requires  a pirellminary  result. 


Leimna; 


- £ Xj  log  J . £1  Xi 


log 


provided  only  that  <7“^,  Xi>0,  4 1. 


Proof  of  Leamat 

n 


^ Xj  log 


£x^  log  ^ n 


^ <n  X 


J 


• — * (1-J^lXj)  log  1 
1 


log  [i-2xj  * £ ^ ^2 

^ S 0-.  -J 


£ 0-. 


< iog[i-£xj*  I^S  ^^.xeg 


0. 


since  £iXj^  (log  ~ - log  ^ 

€ 


} <^1 


- ^ X^  log 


^ X.  log  ^ ^1 


we  have 
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4-^  (Pi-Pil)  log  (p^-p^j)  4^  (Pi-Pii)  log  (1-p) 


4 H(loo). 
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The  value  is  over-estimated  by  the  value  Hoy^(ln) 

?=  H(loc)  + H(cor  ).  It  follows,  that 

H(in)  - Hj5^^(in)  4 T(ln;out) 

In  other  words « we  err  on  the  conservative  side  If  we  take 
Kjut(in)  estimate  of  HQ^^(ln).  The  final  result  may 

be  considered  as  an  example  of  the  general  statement  that  any 
source  which  fully  corrects  the  errors  of  transmission  must 
have  an  entropy  no  less  than 
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REMARKS  ABOUT 
THE  METHOD  OP  HINTS 
H*  Quastler 

A subject  receives  a message.  One  wishes  to  establish 
how  much  of  the  message  was  assimilated*  "How  much"  is  taken 
to  mean  "how  many  information  vinits"  (or  "bits").  If  the  S 
is  able  to  transmit  the  message  correctly*  then  he  certainly 
has  assimilated  its  Information  content*  If  his  transmission 
is  only  p^tlally  correct*  then  we  may  give  the  3 hints  con- 
taining some  atuclllary  Information  which  wl?.l  help  him  to  cor- 
rect his  errors  and  reconstruct  the  entire  message*  It  can 
be  shown  that 

[Lnount  of  information]  v /Amount  of  infor^  / Amount  of~] 
L assimilated  J Imation  in  input  I * lixif ormation | 

^ L in  hints  J 

where  all  amounts  of  information  are  meastired  in  the  same 
units  (bits). 

There  are  many  ways  of  giving  hints.  A good  method  will 
be  one  which  fulfills  the  following  conditions: 

(1)  it  permits  accurate  estimation  of  the  amount  of  infor- 
mation contained  in  the  hints* 

(il)  it  does  not  confuse  the  S* 

(Hi)  it  gives  as  little  information  as  possible* 

The  issue  can  be  made  clear  by  reference  to  a familiar 
situation*  A student  is  being  examined;  one  wishes  to  give 
him  credit  for  what  he  knows*  Notoriously*  his  first  answers 
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do  not  reveal  the  full  extent  of  his  knowledge.  The  examiner 
tries  to  help  with  hints.  He  must  try  to  keep  the  actual  In- 
formation In  the  hints  small  (otherwise  the  student  could  not 
bo  given  credit  for  the  answer)  aai  not  to^-confuso  the  student. 
The  requirement  dealing  with  numerical  estimation  does  not 
apply*  since  no  quantitative  estimation  of  the  knowledge  is 
obtained. 

The  method  of  hints  is  analogous  to  an  often-used  method 
of  estimating  amounts  of  information  transmitted; 

[Amount  of~\  [Amount  of~l  iLnount  of  information! 

tinformatlonl  llnformatlonl  **  \ needed  to  locate  | 

|transmltte^  in  input  J land  correct  errors  J 


Butt  the  two  are  not  identical.  It  is  true  that  the  hints 
sv^ply  "information  needed  to  locate  and  correct  errors";  how- 
evert  in  addition  to  this  information  S may  use  information 
stored  in  his  memory  but  not  utilized  in  his  first  statement. 
For  instance,  a situation  like  the  following  might  happen: 
the  display  is  i.  dot  on  a vertical  line;  the  line  is  thought 
to  be  divided  into  intervals  of  equal  length;  3 is  asked. to 
state  which  interval  contains  the  dot.  Suppose  he  makes  errors 
with  an  over-all  probability  ^ and  that,  in  case  of  error,  he 
is  too  high  by  one  interval  with  a probability  of  a,  too  low 
by  one  interval  with  a probability  of  (1-a);  no  other  errors 
occur.  Then 


Eiimotint  of  information 
teded  to  locate  errors, 
per  act  


- q logg  q - (1-q)  logg  (1-q) 
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Imount  of  informa- 
tion needed  to 
[correct  errors, 

I per  act 


- q 12®  logg  a 4 (1-a)  logg  (1-a)^ 


These  two  quantities  must  be  deducted  from  the  input  informa- 
tion to  obtain  the  amount  of  information  transmitted*  Suppose, 
now,  that  S has  some  recollection  about  the  direction  of  his 
error;  then,  telling  him  when  he  has  committed  an  error  will 
be  all  he  needs  to  produce  the  correct  statement*  Thxis,  the 
amo\mt  of  information  assimilated  will  be  greater  than  the 


amoimt  of  information  transmitted  (in  the  first  statement)  by 
the  amount  q Q logg  o + (1-a)  q (l-alj^*  Some  experi- 
ences, reported  elsewhere  in  this  collection,  indicate  that 


the  exan^le  given,  while  greatly  sin^llfled,  is  not  unrealistic, 

In  general,  the  estimated  "amount  of  information  assimi- 
lated" may  be  larger  or  smaller  than  the  estimated  "amount  of 
information  transmitted,"  depending  on  the  amoxmt  of  such  re- 
tained information,  on  the  efficiency  of  giving  and  utilizing 
hints,  and  on  the  efficiency  of  the  error  locating  and  correct- 
ing code*  Ordinarily,  we  do  not  expect  the  two  to  differ 
widely  from  each  other* 

The  following  paper,  by  A*  A*  Blank,  gives  a model  of 
performance  for  one  particular  Method  of  Hints*  It  is  the 
only  model  which  has  been  worked  up  in  some  detail,  and  was 
used  on  experimental  data* 
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A METHOD  OP  HINTS' 

A*  A*  Blank 

Let  us  suppose  we  have  a stochastlo  source  of  Independent 
Inputs  aj^  (1  = 1,  and  a response  generator  which  has 

the  conditional  probability  q(l|j)  of  aj^  having  been  emitted 
by  the  source  when  the  response  Is  aj  and  each  response  Is  In- 
dependent of  any  other*  The  uncertainty  as  to  the  source  when 
the  symbol  aj  has  been  reported  is  defined  as 

H (in)  * - ^ q(i)j)  log  q(i|j). 

Suppose  again  that  when  the  receiver  is  in  error  the  fact  of 
the  occurrence  of  error  is  registered  and  fed  back  and  the  res- 
ponse generator  is  then  constrained  to  report  differently*  Ve 
shall  consider  two  possibilities: 

1*  Complete  utilization  of  the  order  of  probability* 

Let  the  conditional  probabilities  of  error  be  ordered 

q(i2|J)  > q(i3|j)  > •••  > qUrlJ) 

where  i2f***»l2,  Is  some  permutation  of  the  indices  excluding  J* 
We  shall  suppose  that,  in  the  event  of  error,  the  response  gen- 
erator will  report  symbols  in  decreasing  order  of  probability 
until  the  correct  one  is  reached*  Lot  Qy(j)  denote  the  fre- 
quency with  which  j is  reported  correctly  at  the  v-th  report* 

We  have 


(v'  « l,.,.,r) 


2 A N«tbo<l  of  Hints 


wtaisns  ws  sot  Cloarly 

Hj(ln)  « - ^ (4v(J)  log  Qy(J). 

Ths  sqnlToestlon  or  unoertalnty  as  to  the  source  Is  given  by 

W<i“)  * - ^ 

J 

Hi'xiest  with  ooBq;)lete  knowledge  of  the  oz*der  of  probability  of 
the  various  errors  we  may  confute  the  equivocation  by  using 
the  method  of  hints  (feed  back  an  error  message)  and  tabulat- 
ing the  probability  for  each  initial  response  of  obtaining  the 
eorreet  report  at  the  v-th  stage* 

2*  laponplete  utilisation  of  the  order  of  probability* 

Imt  us  svq>pose  that  the  generator  reports  in  the  order 
of  deereasing  probability  until  the  k-th  stage  and  that  from 
the  (k  4 l)-th  stage  on  responses  are  chosen  with  eqtaal  proba- 
bility from  the  remaining  alternatives*  In  that  case  set 

^(3)*<lClvU)  v«(l...k) 

P(J)  ■ 1 - i 


We  have* 

Hj(ln) 


k 

¥■1 
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Hj(in)  represents  the  maximm  possible  equivocation  with  res- 
pect to  the  source  if  the  order  of  the  first  k conditional 
probabilities  of  response  are  utilized  in  the  prescribed  man- 
ner, This  estimate  sacrifices  only  knowledge  with  respect 
to  the  rarer  events. 

The  inequality  above  leads  to  the  value  ^ qjH*(x)  as  an 
upper  estimate  for 

The  coB5)Utatlon  of  this  estimate  requires  less  than  the  conq)u- 
tatlon  of  “out  (in)  since  it  requires  k r categories  (k  ==  y x 
^ lc(J)  < r).  Instead  of  r^  categories. 

It  is  not  difficult  to  establish,  with  the  same  data, 
lower  estimates  for  Hj(in)  and  hence  Hout  (in).  We  set 
t k 

Hi(ln)  = - ^ los  Qy(J)  - log  P(J) 

v=l 

k r 

where  P(J)=1-  <^(j)  = < q(lv|J) 

V*=l  V^4-l 

t 

Hj(ln)  represents  the  minimum  equivocation  with  respect  to 
the  soTirce  if  the  first  k responses  are  ordered  according  to 
the  conditional  probabilities  of  initial  response.  Prom  the 
lneq\iallty 

- ^ q(iyi ^ ^ q(iylJ)  log  £ qUvU 

v*=k+l  v*pk'M  v^k+l  ^ 

hJ(^  .)  4 H^^n) 


it  follows  that 
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A more  refined  estimate  Is  sometimes  given  toy 

It 

H,(in)  - 2 Qv(j)  log  Qv(j)  - P(3)  log 
J v*l 


II 

Hj(ln)  approximates  the  minimum  equivocation  with  respect  to 
the  source  if  the  first  k responses  are  ordered  according  to 
the  conditional  probabilities  of  initial  response  and  if  the 
probability  of  correct  response  at  any  stage  does  not  exceed 
that  at  any  preceding  stage.  Prom  the  inequality  qdj^l  J)  ® 
«lc(3)  > qdylj)  V > k,  we  have 

- I qdvd)  log  qdvU)  > - £ qdyU)  log  qdJ  j) 

and  hence • 


TTj(ln)  4 Hj(in) 


If  Q^(J)  < P(J)  then  Hj(ln)  is  a better  estimate  of  Ej(ln) 
Let 


than  H^(ln). 


Ij(ln)  = Max  (Hj(ln),  Hj(in)')  ^ Hj(in), 


where  H. (in)  is  the  better  approximation  to  H.(ln)  of  the  two 

•»  I n J 

lower  bounds  Hj(in)  and  Hj(ln).  We  have 

£ qjHjdn)  ^ I^^^Cin)  ^ £ qjH*{ln)« 


3*  Pooled  data. 

In  some  Instances  it  may  be  impossible  to  obtain  reliable 
values  of  the  frequenoy  of  correct  response  at  the  v-th  stage.. 


k Method  of  Hints  $ 


for  each  Initial  response.  One  nay  only  know  or  be  able  to 
utilize  the  frequency  of  correct  response  at  the  v-th  stage 
taken  over  all  Initial  reports.  This  would  be  the  value 

We  nay  estlnate  H^j^^(ln)  by 

^out^^®^  • - ‘IjHj(ln)  > 

If  no  data  are  obtained  beyond  the  k>th  response  we  may  be 
sure  that  bounded  above  and  below  by 

where 

^ut<^“^  - Qv  ^ r?Tc 

and 

I^tt^(ln)  » Max  ^3j,^^(ln), 

Here 

k 

P » 1 - £ <lv 
v«l 

and 

^A*(ln)  » - ^ log  t log  ? 

r— * 

k 

HjjJ^dn)  » - Qiy  log  Qy  - P log  Qjj. 

The  barred  symbols  represent  a lusqplng  of  response  data  Into 
fewer  categories  and  the  starred  symbols  represent  the  equal 
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division  of  response  data  into  available  categories. 

We  cannot  be  sure  of  the  relationship  of  HQ^^(ln)  to 
H^^^(ln),  but  wo  do  know  that  H^y^(ln)  is  an  overestimate* 

To  obtain  a lower  estimate  of  H^^^Cln)  from  pooled  data 
wo  may  proceed  as  follows; 

We  have  HQ^|.(ln)  * H(ln*out)  - H(out) 

r r 

Now  H(in,out)  ■ - £ p . log  p . 

v=l  3*1 

where 

Pvj  = qj  Q.v(J)* 

Clearly 

PyJ  “ 

r r 

■ jfi  > ■ jli  >’vj  Pvj  ■ - <i»  log  Q, 

Henoe»  . 

Blo^^(ln)  ^ - “2  Qy  log  (iy  - H(out) 

or 

r P ^ 

^ ^ log  Qy  ♦ qj  log  qj  * H^^^(ln)  - 

H(out ) * 

If  we  omit  the  data  for  the  categories  v * k4-l**,.,r*  we  ob- 
tain a still  lower  estimate 

H^y^(in)  > ^ ^ut(i»)  - H(out) 

These  last  estimates  are  very  crude*  Zero  should  often  be  a 
better  lower  estimate  of 
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THE  UNCERTAINTY  MEASURE  FOR  QUANTIZED  NORMAL  DISTRIBUTIONS 

A*  A.  Blank 

Whenever  a discrete  random  variable  may  be  thovight  of  as 
having  its  values  imbedded  In  a continuum  there  is  a sugges- 
tion that  it  may  be  convenient  to  consider  the  distribution 
of  the  discrete  variable  as  the  quantization  of  the  distribu- 
tion of  some  continuous  random  variable.  In  the  usual  game 
of  darts,  for  exan^le,  the  compartment  in  which  the  dart 
sticks  is  a discrete  variable,  but  represents  in  an  obvious- 
way  the  quantization  of  the  error  distribution  in  attempting 
a strike' at  the  bull* a eye. 

In  some  communication  problems  the  discrete  outcomes  aris- 
ing may  also  be  thought  of  as  a qtiantlzatlon  of  a continuous 
distribution  (e.g«,  when  a dial  is  read  to  the  nearest  tenth 
of  a division).  In  Information  theory,  the  uncertainty  fxmc- 
tlon  is  not  defined  in  the  same  way  for  discrete  and  continu- 
ous distributions.  The  uncertainty  for  a discrete  variable 
is  defined  as 

(1)  H »=  - £ pjj  log  pjj 

where  the  value  pj^  denotes  the  probability  that  the  variable 
will  take  its  k-th  value.  In  the  continuous  case,  t^  xancer- 
talnty  is  defined  by  the  integral 

(2)  h’  * - r log  f(^)  d P (£) 

ysi 

where  ^ denotes  the  random  variable,  f (^)  is  its  probabll- 
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Ity  density,  F ( & ) Its  emulative  distribution  function  and 
JTI  is  the  san9>le  space  in  which  ^ varies.  As  immediate 
points  of  difference  between  the  two  definitions  it  will  be 
observed  that  H is  non-negative  while  H*  may  assume  any  real 
■value  whatever. 

!Che  moat  useful  distribution  to  treat  in  this  manner  is 
the  quantized  normal  distribution.  The  normal  density  func- 
tion 2 

7m 

gives 

(U)  H * 7 log  2 'TT  e 6“  ^ 

and  sinoe  may  be  any  non-negative  value  H*  may  assume  any 
value  from  minus  to  plus  infinity.  If  a discrete  distribution 
may  be  Interpreted  as  a quantization  of  a normal  distribution 
it  is  especially  convenient  to  use  (4)  siAoe  all  we  have  to 

I . < 

do  to  confute  H la  to  fizid  o . H will  be  a good  approxima- 
to  H if  the  distribution  is  quantized  into  units  sufficiently 
small  with  respect  to  S • If  the  precision  is  high  and  most 
of  the  values  of  the  discrete  variable  fall  in  one  class,  then 
H will  be  very  close  to  zero  but  H*  will  take  on  large  nega- 
tive values;  in  this  case,  therefore,  the  approximation  cannot 
bo  used. 

Let  us  see  generally  how  H*  is  related  to  H.  If  the 
axis  of  the  oontinmus  variable  is  broken  into  intervals  of 
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# 

size  A t and  X * 
X from  (1)  where 


then  H may  be  coiq[>uted  as  a function  of 


X 

X 


dt  (ic  * Of  ^ If  ^ 2f  • • •) 


The  values  of  H*(X)  and  H(X)  are  both  plotted  against  X 
on  the  accong>anylng  graph.  Per  X > 2,  that  Is,  wo 

may  for  all  practical  purposes  assvuae  H*  = H.  For  X < 0.1, 
or  A > 10  <T,  we  may  assume  H » 0.  A nomogram  for  Interme- 
diate values  of  X Is  Included. 

In  effect,  when  a distribution  Is  obtained  which  may  be 
assumed  to  be  quantized  normal,  the  associated  uncertainty  may 
be  estimated  by  obtaining  the  variance  of  the  corresponding 
continuous  normal  distribution  and  looking  up  H(X)  on  the 
graph.  Alternatively,  one  may  obtain  h'  (X)  and  look 
H(X)  on  the  nomogram. 


Sheppard's 

^2 


estimate  then  gives  <T  by 


- 


A 


where ^ Is  the  variance  of  the  quantized  distribution. 


NOMOGRAM 


Different  Methods  1 


DIFFERENT  METHODS  COMPARED 
H«  Quastler  . 

A - ” INFORMATION  TRANSMITTED"  VS.  "PERCENT  SUCCESS" 

Among  the  measures  which  can  be  used  assess  human  per- 
fomances  are  "information  transmitted"  and  "percent  success." 
The  two  measures  are  often  rovjghly  proportional. 

The  amount  of  information  transmitted  can  be  thoxight  of 
as  the  sum  of  Information  transmitted  in  successful  acts, 
minus  Information  needed  to  locate  successful  acts*  plus  in- 
formation transmitted  in  errors  (due  to  "near  misses"  aryl 
other  regularities  of  error  patterns )«.  Using  the  symbols: 

p . . • per  cent  success 

H(ln)  • • . Information  inputs  per  act 

H(loc)  ...  Information  needed  to  locate  errors » per 
act 

H(cor)  ...  Information  needed  to  correct  errors*  per 
act 

T(ln;out)  . • Information  transmitted*  per  act 


we  have 


[Information  transmitted 
^ in  successful  acts  J 

flnformatlon  transmitted 
I-  in  errors  J 


p . H(ln) 


(1-p)  . H(in)  - H(cor) 


and 


T(ln;out)  *=  H(ln)  [p+l-p]  - H(loc)  - H(cor) 


The  claim  made  is  that  often 


p . H(ln)  ^ T(in;out). 
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Now,  this  is  certainly  not  generally  true.  In  fact,  one  can 
have  perfect  transmission  with  nothing  but  errors  (e.g,,  if 
a transducer  receives  a binary  message  and  passes  it  on  after 
inverting  each  symbol);  it  is  also  possible  that  no  informa- 
tion at  all  is  carried  in  errors,  in  which  case  p • H(in)  is 
an  overestimate  of  T(in;out),  Ordinarily,  neither  extreme  is 
likely  to  occvir.  For  moderate  error  rates,  and  a moderate 
amount  of  information  carried  in  errors,  the  approximation  will 
not  bo  too  bad.  This  is  shown  graphically  in  the  figiire. 

The  following  two  extreme  examples  are  taken  from  work 
done  in  this  laboratory.  In  one  (A,  A,  Blank),  S had  to  re- 
cognize letters;  H(ln)  per  letter  was  varied  by  using  vari- 
ous constraints.  In  this  case,  there  is  very  little  informa- 
tion carried  in  errors;  in  most  cases,  S recognizes  a letter 
accxirately  or  not  at  all.  Hence,  p , H(ln)  is  larger  than 
T(ln;out): 


Input: 

Single  equlprobable 
letters 

Letters,  English 
frequencies 

Pairs  of  initial 
letters 

4-letter  words 


H(in), 
per  letter 


Sub,  A Sub,  B 

b,H(ln)  ,T(in;out)iip,H(ln)  T(in;out)* 


4.7 

4.1 

3.2 

1.8 


2.2 

2.3 

2.3 

1,6 


1.5 

1.4 

1.4 

1.0 


2.5 

2.5 

1.9 

li7 


1.9 

1.7 

0.9 


» estimated  by  Method  of  Hints 


The  next  example  (J.  W.  Osborne  and  K.  S,  Tweedell)  deals  witb 
the  task  of  locating  a marker  on  a scale.  In  this  case,  near- 
xnlsses  are  the  rule.  Thus,  errors  carry  considerable  informa- 
tion, and  p . H(in)  is  sometimes  less  than  T(ln;out); 
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no.  of  intervals 
in  scale 

H(in) 

Sub.  P 

Ip.Hdn)  T(  ins  out) 

sub.  V 

llo.H(ln)  T(  ins  out) 

16 

4*0 

3.3 

3.2 

4*0 

4.0 

20 

4.3 

3.2 

3.2 

m 

€0 

2k 

4.6 

2.3 

2.9 

4.2 

4.1 

32 

5.0 

2.7 

3.1 

3.5 

3.8 

36 

5.2 

2.9 

3.3 

3.6 

3.7 

U8 

5.5 

2.5 

3.2 

4.5 

4.3 

B « "raBOR  M4ORIT0DX*’  VS.  "HINTS" 


This  oonparlson  was  Bade  dtirlng  an  Inrestlgatlon  of  the 
aaount  of  lafomatlon  aaslBllated  from  a single-pulse  display. 

The  display  ooatalned  one  or  two  strips  or  dials*  dlTlded  into 
dlsoXHite  Intervals)  a marker  was  placed  at  the  eenter  of  any  one 
Inteznral)  the  subject  had  to  state  in  which  interval  it  was 
(J.W.  Osborne  and  K.8.  Twsedell). 

In  four  particular  runs*  the  equivooation  was  evaluated  both 
by  the  BSthod  of  hints  and  by  error  nagnltxides.  In  the  fornsr 
case*  the  procedure  fcllowed  was  that  described  by  A.  A.  Blank 
(^A  Msthtd  of  Bints*"  this  collection)}  in  the  latter  case*  the 
procedure  was  based  on  the  discrepancy  between  the  input  and  8*s 
first  stateaent.  Let  § be  the  magnitude  of  this  error*  in  scale 
intervals)  be  the  observed  frequency  of  errors  of  magnutude  i) 
and  8(1*)  the  uncertainty  concerning  the  error  magnitude)  then 


H(t)  « - ^ legg  ^ 


1*  ♦ 2* 
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If  H(Z)  would  be  evaluated  separately  for  each  output  cate- 
gory, then  it  would  be  an  accurate  estimate  of  the  equivoca- 
tion) if  all  output  categories  are  lumped  (as  in  this  case) 
it  is  a lower  boiind. 


The  following  table  shows  the  results: 


Display 

no*  of 
tests 

trials 

per 

test 

mean 

H(ln) 

mean  est*  T(in;out) 

error  maanltude 

hints 

horizontal  strip 
with  Intervals 
in  black  & white 

5 

80-120 

4.77 

3.77^^ 

3*90 

horizontal  strip 
(intervals 
blank) 

5 

80-120 

4.77 

3.47^^ 

3*52 

2 strips,  24 
Intervals , 
marked  black 
axkL  white 

8 

80 

4.58 

2*51^^ 

2*62 

2 dials,  24 
interyals 

8 

80 

4.58 

3.UU 

3.52 

1)  results  given  for  single  strip  or  dial* 

2)  terminal  Intervals  treated  separately. 

3)  no  separate  treatment  of  terminal  intervals* 

In  all  fotir  cases,  the  estimation  by  the  method  of  hints 
gave  slightly  higher  values  for  T(in;out)*  This  might  mean 
that  S has  assimilated  some  information  which  does  not  appear 
in  his  first  response,  but  is  produced  in  response  to  hints* 


0 - "SRSOR  MAG-NITUDS"  VS*  "VARUNGE" 

This  comparison  was  based  on  data  obtained  in  a recogni- 
tion experiment.  The  display  was  a dot  in  a square.  The  dot 
could  assiime  a limited  nvimber  of  positions  in  the  square* 
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The  eubjeot  was  given  & aoore  sheet  on  which  the  permissible 
positions  were  marked;  he  had  to  state  In  which  of  these  pos- 
itions the  dot  was  located.  (J.  W.*  Osborne  & K.  IS.  Tweedell). 

Ve  eraluatedt  separately*  location  vertically  and  horl- 
Bontally.  The  data  were  worked  up  by  evaluation  of  error  mag- 
nitude* as  described  above.  In  addition*  we  computed  the 
Tarli^noe*  or  mean  squared  deviation  between  Input  and  output* 
lumping  all  output  categories.  ?rom  the  variance*  the  eqid.- 
vooation  can  be  coaqputed  (see  A.  A.  Blank*  "The  TJhoertainty 
Measure  for  Quantised  Normal  Distributions"*  this  volume). 

The  following  table  shows  the  amounts  of  equivocation* 
by  the  two  alternate  methods;  each  entry  Is  based  on  40  trials; 


No*  of  possible 
positions t 

Method  of 
computing 
eauivocation 

Sub^ 

loot 

1 

2 

3 

h . .. 

1$  X 15 

1.8 

2.2 

1.6 

1.9 

T. 

2.0 

2.4 

1.6 

1.6 

19  X 19 

l*H. 

2.2 

2.0 

1.9 

2*0 

V* 

2.2 

2.2 

2.0 

1.9 

31  X 31 

l*M. 

2.9 

3.2 

2.7 

2.7 

V* 

3.2 

3.2 

2.7 

3.0 

1)  Irror  magnitude. 
2}  Varlanoe. 


One  sees  that  the  "Variance"  method  tends  to  give  slightly 
higher  values  for  the  equlvooation.  This  is  caused  by  the  . 
presence  of  a few  very  large  errors*  which  contribute  greatly 
to  the  varlanoe*  but  not  much  to  the  uncertainty  measure* 


